CDS

Accession Number TCMCG075C20679
gbkey CDS
Protein Id XP_017979493.1
Location join(25344464..25344573,25345449..25345533,25345783..25345844,25345939..25346150,25346261..25346324,25346566..25346638,25346777..25346897,25347539..25347626,25347716..25347841,25348038..25348147,25348540..25348712)
Gene LOC18597157
GeneID 18597157
Organism Theobroma cacao

Protein

Length 407aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018124004.1
Definition PREDICTED: probable beta-1,3-galactosyltransferase 1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyltransferase 31 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K20855        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTCTGTGAAGAGTAGGGGAGAGCTTGCTGCTAAGCATGTTTTATCAAGGAACTTGGCTCTCTTGCTTTGTTTTGCTAGCTTCTGTGCAGGGATGTTCTTCACCAATAGGATGTGGATGCTGCCTGATGCCAAAGGCATTCCGAGGACATCTAGAATCGGGGTTGAACAATCTCTGAACTGTGATAAAAAAATTAAGGCCTTAAACAATGAAGCCAATAGCTCTGGAGGCAGTTCAGGGTCCCAACATTCTATTCAGACTCTGGATAGAGCCATTTCAGATTTGGAGATGAAAATAGTGGCTGCTAGGGCAGAGCGTGAGACAATTATGAAAGACCCTATTATATCAGAAGACTTGAAGAATGTTAAATCAACCTTAAAAAGAAAATATTTTATGGTCATAGGAATCAATACAGCTTTTAGTAGCCGTAAGCGGAGAGATTCAGTGCGTGCAACTTGGATGCCTCAAGCTGAGAAGCGAAAAAAATTGGAGGAAGAGAAAGGCATCATTATTCGCTTTGTTATAGGTCACAGTTCAACATCAGGTGGTATTCTTGATAAAGCCATTGAAGCAGAGGAAAAGGTGCATGGAGACTTTTTGAGATTGCAACACATTGAGGGCTATCTGGAGTTGTCAGCCAAGACAAAAACTTATTTTGCCACTGCTGTTTCCTTGTGGGATGCGGAATTTTATGTCAAAGTTGATGATGATGTTCATGTAAATCTAGCAACACTTGGTTCGACTTTAGCTGGACATAGTAATAAACCTCGAGTTTATATCGGCTGCATGAAGTCTGGTCCTGTTCTTGCTCGAAAGGGAGTGAAATACCATGAACCTGAGTACTGGAAATTTGGTGAAGTTGGAAACAAATATTTTCGACATGCTACAGGGCAACTGTATGCTATATCAAAAGATTTGGCCACTTACATATCAATAAATCAGAATGTACTGCATAAATATGCTAATGAAGATGTTTCATTGGGATCTTGGTTTATCGGTTTAGATGTGGAGCATGTTGATGATAGGAGACTCTGCTGTGGTACTCCACCAGATTGTGAATGGAAAGCTCAAGCTGGTAACATCTGTGTCGCATCTTTTGACTGGAGGTGCAGTGGGATTTGCAGGTCTGTAGAGAGGATCATAGAAGTTCATGAACGTTGTGGTGAGGACAAGAATGCTTTATGGAGCACAAACTTTGTGCAAACAACAAGCAGTTCTTTCTGA
Protein:  
MSVKSRGELAAKHVLSRNLALLLCFASFCAGMFFTNRMWMLPDAKGIPRTSRIGVEQSLNCDKKIKALNNEANSSGGSSGSQHSIQTLDRAISDLEMKIVAARAERETIMKDPIISEDLKNVKSTLKRKYFMVIGINTAFSSRKRRDSVRATWMPQAEKRKKLEEEKGIIIRFVIGHSSTSGGILDKAIEAEEKVHGDFLRLQHIEGYLELSAKTKTYFATAVSLWDAEFYVKVDDDVHVNLATLGSTLAGHSNKPRVYIGCMKSGPVLARKGVKYHEPEYWKFGEVGNKYFRHATGQLYAISKDLATYISINQNVLHKYANEDVSLGSWFIGLDVEHVDDRRLCCGTPPDCEWKAQAGNICVASFDWRCSGICRSVERIIEVHERCGEDKNALWSTNFVQTTSSSF